This folder contains the code for the paper 'The Curse Revisited: When are Distances Informative for the Ground Truth in Noisy High-Dimensional Data?', presented at The 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022.
arxiv submission: https://arxiv.org/abs/2109.10569.
- R (Rstudio)
- Python (Jupyter notebook)
- ggplot2 (general plotting)
- igraph (working with graphs)
- ggpubr (combining plots)
- FNN (fast k-NN graph computation)
- parallel (parallel processing in R)
- umap (umap dimensionality reduction)
- diffusionMap (diffusion map dimensionality reduction)
- rpca (Robust PCA dimensionality reduction)
- dimRed (Isomap dimensionality reduction)
- numpy (handling arrays)
- pandas (handling data frames)
- pytorch (neural networks in Python)
- matplotlib (plotting)
- Cell trajectory: https://zenodo.org/record/1443566 (also available from "Data" folder)
- Folder "Scripts": contains the code (R) for producing the cell trajectory visualization prior to the experiments section in the main paper.
- Folder "Experiments": contains all code (R + Python) for producing the results in the experiments section of the main paper, with files named accordingly.
- Open file in Rstudio
- Source --> Source with Echo (ctrl + shift + enter).
- Open file in Jupyter notebook
- Cell --> Run Cells (ctrl + enter).
- Folder "OutputPDF": contains all output by code block in PDF format, with files named accordingly.